Big Biological Data

  • Genomic and trancriptomic sequencing
  • Molecular modeling and dynamics
  • Mass spectrometry

Challenges in visualizing big data😞?

Scatter Plot

Solutions

Network

Solutions

Heatmap

Data Visualization

  • Figure: a combination of geometric objects
    • dots
    • lines
    • areas
  • Geometric objects have attributes
    • location
    • color
    • size
    • shape
    • transparency
  • Data visualization: mapping categorical/numeric variables to geometric objects and their attributes

ggplot2

The Grammar of ggplot2

Geom: which geometric object to use?

Aesthetics: manipulate attributes of geometric objects

Coordinate system

Scale: adjust aesthetic mappings

Unique values of carb

## [1] 4 1 2 3 6 8

Theme: determine the plot appearance

Facet: split into multiple plots

Guides: axes and legends

Learning Goals

  • Several skills in big data visualization
  • The grammar of ggplot2
  • Manipulate aesthetics and plot appearance
  • Create publication-ready plots